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(54) Stereoscopic image generating apparatus and display therefor 



(57) The stereoscopic CG image generating appa- 
ratus and a stereoscopic TV apparatus, has a projection 
transformation section which, based on three-dimen- 
sional structural information describing a three-dimen- 
sional shape of an object, generates a plurality of two- 
dimensional projection models as viewed from a plural- 
ity of viewpoints, a distance information extraction sec- 
tion which generates a camera-to-object distance 
information used for calculations in the projection trans- 
formation section, and a camera parameter determining 
section which, based on the output of the distance infor- 
mation extraction section, the screen size of a stereo- 
scopic image display device for displaying finally 
generated two-dimensional projection models, and a 
viewer's viewing distance, determines camera parame- 
ters so that stereoscopic CG images will be brought 
within the viewer's binocular fusional range. According 
to the thus constructed stereoscopic CG image generat- 
ing apparatus and stereoscopic TV apparatus, proper 
camera parameters (focal length or field of view, camera 
spacing, and converging point) are determined based 
on the camera-to-object distance information, the mag- 
nitude of parallax of the generated stereoscopic CG 
images on the display device (or in a window on the dis- 
play screen), and the viewing distance, so that easy-to- 
view stereoscopic CG images are automatically gener- 
ated regardless of the display size, and by horizontally 
translating left-eye and right-eye images, binocular par- 
allax of displayed images is automatically brought within 
the viewer s binocular fusional range regardless of the 
size of a stereoscopic display used. 
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Description 

BACKGROUND OF THE INVENTION 

1 .Field of the Invention 5 

The present invention relates to a stereoscopic CG 
image generating apparatus for making stereoscopic 
vision possible by stereoscopically displaying two- 
dimensional images generated from three-dimensional 10 
structural information, and also relates to a stereoscopic 
TV apparatus for displaying a stereoscopic image. 

2. Related Art of the Invention 

15 

An example of a prior art stereoscopic image gen- 
erating apparatus is shown in Figure 10. According to 
this apparatus, three-dimensional structural informa- 
tion, describing a three-dimensional shape of an object 
by a surface model, is input (the object is approximated 20 
by a plurality of small surfaces called polygons, and the 
structural information defines the three-dimensional 
positions of the vertices of each polygon and the faces 
and edges formed by the polygons), and the object 
defined by this information is arranged in a world coor- 25 
dinate system. Then, projection transformation sections 
1 and 2 calculate the two-dimensional positions of the 
object that would be projected on a film when photo- 
graphed by an imaginary camera, and rendering sec- 
tions 3 and 4 determine the brightness and color (e.g., 30 
R, G, B values) of an image within each polygon on the 
basis of the material of the object, the type of the light 
source used, and the three-dimensional positions. 

For example, a geometric model of a polyhedron, 
such as the one shown in Figure 1 1(a), is described by 35 
the three-dimensional coordinates of vertices V1 to V8 
and the data structure (forming faces and edges) of the 
geometric model, as shown in Figure 11(b), and the 
object described by this information is arranged in the 
world coordinate system as shown in Figure 12(a). 4C 
Then, an image (vertices) of the object projected on a 
screen 50, as viewed from viewpoint E of the camera, is 
calculated. Next, the positions on the screen of the 
faces and edges formed by the vertices and their bright- 
ness and color are calculated to produce an image for 4t 
output. At this time, in order to produce a stereoscopic 
image, images as viewed from at least two viewpoints 
need to be calculated; therefore, camera parameters 
must be specified as shown in Figure 12(b), that is, 2Wc 
which is the spacing between a plurality of cameras, CL 5< 
and CR which are the positions of the camera view- 
points, P which is the three-dimensional coordinates of 
the converging point of the cameras, and f which is the 
focal length of the cameras (or e which is the field of 
view). 5 

Figure 18 shows an example of a prior art stereo- 
scopic TV. apparatus for displaying a stereoscopic 
, image. 

This apparatus comprises two CRTs with crossed 



polarizing filters attached to their respective display sur- 
faces, and a half-silvered mirror is used to combine the 
two display images. When viewed by a viewer wearing 
glasses constructed from corresponding polarizing fil- 
ters, the images are shown to the viewer's left eye and 
right eye, respectively. 

However, in the above prior art stereoscopic CG 
generating apparatus, the plurality of camera parame- 
ters have to be changed according to the viewing dis- 
tance and screen size, but in actuality, these parameters 
are adjusted by a CG operator, based on his experi- 
ence, by viewing the generated stereoscopic CG 
images and setting the parameters so that an easy-to- 
view image can be presented to the viewer. There is 
therefore the problem that if stereoscopic CG images 
generated with improperly adjusted parameters are dis- 
played on a stereoscopic image display device, the bin- 
ocular parallax of the stereoscopic images (expressing, 
for example, the difference between the horizontal posi- 
tions of the same vertices in the left and eight images in 
terms of view angle) often exceeds the allowable range 
of the viewer, resulting in unnatural stereoscopic 
images that tend to increase eye strain. 

In view of the above problem of the prior art stereo- 
scopic CG image generating apparatus, it is an object of 
the present invention to provide a stereoscopic image 
generating apparatus that can automatically generate 
natural and easy-to-view stereoscopic images for a 
viewer regardless of the viewing distance and screen 
size. 

In the case of the prior art stereoscopic TV appara- 
tus, when the same stereoscopic image signal is input, 
if the screen size is different, the binocular parallax of 
displayed images is also different. Figure 19 explains 
this; that is, binocular parallax As on a small display 
screen (a) increases to ALon a large display screen (b). 
If this binocular parallax becomes too large, the viewer 
will have difficulty in achieving stereoscopic vision, thus 
increasing eye strain. 

Difficulty in achieving stereoscopic vision means 
that, if binocular parallax AN becomes large, and the 
distance between the image display screen and point P 
where the object is perceived for stereoscopic viewing 
increases, as shown in Figure 20(a), there arises a con- 
flict between the adjustment of the viewer's eye lenses 
and the distance perceived by stereoscopic vision, and 
(if P moves further closer) binocular stereoscopic vision 
cannot be achieved. In the case of Figure 20(b), in ster- 
eoscopic images an object at distance 00 is displayed 
with binocular parallax coinciding with the interpupillary 
distance of the viewer. If the binocular parallax AF 
becomes larger than that, the viewer will be unable to 
achieve binocular stereoscopic vision. 

For recent computer graphic terminals, multisync 
; monitors are widespread that can be switched between 
multiple resolution modes. The resolution (display fre- 
quency) can be switched over a wide range, for exam- 
ple, from a low resolution mode of 640 x 400-pixel 
screen generally used for personal computers to a high 
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resolution mode of 2000 x 1 000-pixel for workstations. If 
one multisync display is used to switch between these 
image signals, the displayed size of an image consisting 
of the same number of dots varies according to the res- 
olution of the image signal because the display screen 5 
size is the same. Figure 19 shows this; that is, part (c) 
shows a display of a low-resolution image signal, and 
part (d) shows a display of a high-resolution image sig- 
nal. In part (d), the displayed image is small, while in 
part (c), binocular parallax As is larger than At. 10 

When stereoscopic CG images or the like are dis- 
played on such a display, binocular parallax of displayed 
images varies greatly according to the image resolution, 
in some cases making it difficult for the view to achieve 
stereoscopic vision and thus tending to increase eye 15 
strain. 

Currently, there are three types of broadcast video 
signals, HDTV, EDTV, and NTSC. These signal formats 
differ not only in resolution but also in screen aspect 
ratio, and hence, there arise differences in display size. 20 
Furthermore, in some display methods, the size can be 
changed as in a windowing environment. Accordingly, 
binocular parallax of displayed images varies greatly, in 
some cases making it difficult for the view to achieve 
stereoscopic vision and tending to increase eye strain. 25 

The present invention is also intended to resolve 
the above-outlined problems involved in stereoscopic 
presentation of natural images, and it is also an object 
of the invention to make it possible to produce easy-to- 
view, natural-looking stereoscopic images by automati- 30 
cally adjusting the amount of binocular parallax accord- 
ing to the screen (window) size even when the same 
stereoscopic image signal is input. 

SUMMARY OF THE INVENTION 35 

According to the present invention, the fusional 
range computing means computes the binocular 
fusional range of the viewer viewing the screen of the 
stereoscopic image display device for displaying the 40 
stereoscopic image of the object, on the basis of the 
pre-entered parameters consisting at least of the size of 
the screen and the viewing distance between the 
screen and the viewer, and the camera parameter cal- 
culating means calculates the conditions for camera 45 
parameters, based on the binocular fusional range and 
on the object-to-camera distance generated by the dis- 
tance information extraction section, so that the object 
in its entirety can be brought within the viewer's binocu- 
lar fusional range; then, using the camera parameter so 
determining section, the CG operator determines the 
camera parameters based on the output of the camera 
parameter calculating means, and based on three- 
dimensional structural information describing a three- 
dimensional shape of an object, using the thus deter- 55 
mined camera parameters, the projection transforma- 
tion section generates the plurality of two-dimensional 
projection images as viewed from the plurality of cam- 
eras. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram showing the configuration of a 
stereoscopic CG image generating apparatus 
according to a first embodiment of the present 
invention; 

Figure 2 is a diagram showing the relationship 
between an object and camera positions in a CG 
space (world coordinate system) according to the 
present invention; 

Figure 3 is a diagram showing a viewer space 
(defining the space where stereoscopic images are 
viewed) according to the present invention; 
Figure 4 is a diagram showing a stereoscopic 
image parallel shooting method according to the 
present invention; 

Figure 5(a) is a diagram showing an example of a 
display produced on a display section in an opera- 
tion section according to the first embodiment, and 
Figure 5(b) is a diagram showing an operation 
panel of the operation section; 
Figure 6 is a diagram showing the configuration of a 
stereoscopic CG image generating apparatus 
according to a second embodiment of the present 
invention; 

Figure 7 is a diagram showing the configuration of a 
stereoscopic CG image generating apparatus 
according to a third embodiment of the present 
invention; 

Figure 8(a) is a diagram showing the .concept of 
near clipping and far clipping (independently for left 
and right cameras), and Figure 8(b) is a diagram 
showing the concept of near clipping and far clip- 
ping (common to left and right cameras), according 
to the third embodiment; -.,. , ! . *= , 

Figure 9 is a diagram showing the configuration of a 
stereoscopic CG image generating itv apparatus 
according to a fourth embodiment of Me present 
invention; 

Figure 10 is a diagram showing the configuration of 
a stereoscopic CG image generating > apparatus 
according to the prior art; . \ : 

Figure 1 1 (a) is a diagram showing an example pf a 
geometric model for explaining three-dimensional 
structural information, and Figure 1 1(b) , is. a dia- 
gram showing data structure of the geometric 
model; •<•,-,, ~, 

Figure 12(a) is a diagram showing a world coordi- 
. nate system and projection transformation, and; Fig- 
ure 1 2(b) is a diagram showing camera parameters; 
Figure 13 is a diagram showing the configuration of 
a stereoscopic CG image generating apparatus 
according to a fifth embodiment of the present 
invention; .......... 

Figure 1 4 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to; a sixth 
embodiment of the present invention; : v - v r . ): .-' 
Figure 15 is a diagram showing the operation of a 
parallax calculation section according , ta, the 



5 



EP 0 751 689 A2 



6 



present invention; 

Figure 16 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to a sev- 
enth embodiment of the present invention; 
Figure 17 is a diagram showing a time-multiplexed 
stereoscopic image signal according to the seventh 
embodiment of the present invention; 
Figure 1 8 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to the prior 
art; 

Figure 19 is a diagram showing relationships 
between binocular parallax and display image size 
and image resolution; 

Figure 20 is a diagram showing a viewer's binocular 
fusional range; 

Figure 21 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to an eighth 
embodiment of the present invention; and 
Figure 22 is a diagram showing the relationship 
between viewing angle of display screen and binoc- 
ular fusional limits. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

(Embodiment 1) 

Figure 1 is a diagram showing the configuration of a 
stereoscopic CG image generating apparatus accord- 
ing to a first embodiment of the present invention. In 
Figure 1, reference numerals 1 and 2 are projection 
transformation sections and 3 and 4 are rendering sec- 
tions; these sections are the same as those used in the 
prior art stereoscopic CG generating apparatus. The 
present embodiment differs from the prior art stereo- 
scopic CG image generating apparatus in that a dis- 
tance information extraction section 5, a fusional range 
verification section 11 , a camera parameter determining 
section 6, and an operation section 12 are added. The 
fusional range verification section 1 1 includes a fusional 
range calculating means and a camera parameter cal- 
culating means. 

The operation of the stereoscopic CG image gener- 
ating apparatus of the present embodiment will be 
described below. First, three-dimensional structural 
information, describing a three-dimensional shape of an 
object by a surface model, is input to the projection 
transformation sections 1 and 2 as well as to the dis- 
tance information extraction section 5. While checking 
the output images produced on a stereoscopic image 
display device (not shown) connected to the rendering 
sections 3 and 4, a CG operator arranges the object and 
an imaginary camera (at midpoint between left and right 
cameras) at appropriate positions in the world coordi- 
nate system as he desires, thus determining its direc- 
tion. The left and right cameras are arranged at 
positions of -Wc and +Wc, respectively, along the x-axis 
with the imaginary camera position V at its origin (see 
Figure 2). It is assumed hers that the camera parame- 



ters at this time (the camera spacing Wc, the focal 
length f, and the distance dx to the converging point to 
be described later with reference to Figure 3) are preset 
as initial values. (The camera spacing Wc used here 
5 refers to half the distance between the left and right 
cameras. The same applies hereinafter unless other- 
wise stated.) 

Next, the distance information extraction section 5 
extracts from the object a point nearest to the imaginary 

10 camera (near point N) and a point farthest from the 
imaginary camera (far point F). The x, y coordinates of 
these points are calculated, and defined as N(XN, YN, 
ZN) and F(XF, YF, ZF), respectively (see Figure 2). If 
these two points both fall within the binocular fusional 

15 range of the viewer, a good stereoscopic CG image is 
obtained. In this case, the far point and near point may 
be determined comprehensively by calculating the aver- 
age of the distances from the imaginary camera and left 
and right cameras, etc. 

20 Based on the three-dimensional coordinates of the 
near point N and far point F, and on the viewer's viewing 
distance ds and the screen size M of the stereoscopic 
image display device on which stereoscopic CG images 
are displayed for viewing (ds and M are parameters 

25 entered in advance), the fusional range verification sec- 
tion 1 1 calculates an effective range (where the viewer 
can achieve binocular fusing) of the camera parameters 
(camera spacing Wc, camera focal length f. and dis- 
tance dx from camera converging point to imaginary 

30 camera position V). Viewer space parameters are 
defined as shown in Figure 3. 

Mathematical expressions for the calculations are 
give below. 

35 Near point condition: [Mathematical 1] 



n" MfW c ^ 
2 • d s • tang- < - -g-S {®} +2AS 



40 



45 



50 



55 



Far point condition: [Mathematical 2] 

D + MfW c _ 
2 - d s • tang- > - ~^{®} +2AS 

/ MfW c _ \ 

^or2W e >- — {®}+2ASJ 



where 



® = 



^(x N+ W c ) + Y N ^(X N -W C ) + Y N 

Y N+ (x N+ W c )^ Y N -(x N -W c )^ 
°x ° X 
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^<x F+ W c )+Y F ^(X F -W C ) + Y F 



W, 



w w 

Y F+ (x F+ W c )^ Y F -(x F -W c )^ 



where 2 x AS indicates the phase difference between 
left and right images on the stereoscopic image display 
screen; usually, 2 x AS is set equal to the viewer's inter- 
pupillary distance (about 60 mm). Further, D- and D+ 10 
represent binocular parallaxes at the nearest point and 
the farthest point, respectively, within the range where 
the viewer can achieve binocular fusion. 

The focal length f and the field of view, 0, of a cam- 
era have a unique relationship with each other as 15 
expressed by [Mathematical 3]; 



[Mathematical 3 ] 



tan§ = J 

2 2 < 



20 



therefore, either may be used to define the parameter. 
Also, dx can be automatically determined by the camera 
position and the three-dimensional position of a point to 25 
which the camera is directed. The fusional range verifi- 
cation section 1 1 calculates every possible combination 
of the camera spacing Wc, camera focal length f, and 
distance dx from camera converging point P to imagi- 
nary camera position V, that satisfies both of the above 30 
expressions. 

In Mathematical 1, D+ and D- indicate the limit val- 
ues inside which the viewer can achieve binocular 
fusion. These values depend on the size of the image 
display screen presented to the viewer. The fusional 35 
range verification section 1 1 stores in advance the val- 
ues of the binocular fusional range corresponding to the 
image display screen, and based on them, evaluates 
the viewer's fusional range. 

Next, the camera parameter determining section 6 40 
determines which or the camera parameter combina- 
tions calculated by the fusional range verification sec- 
tion 11 is to be used. 

For example, one of the following methods is used. 

45 

(1) While checking the output images by operating 
the operation section 12, the CG operator tries var- 
ious camera parameter combinations calculated by 
the fusional range verification section 11 and 
selects one that he thinks gives the best result. so 

(2) The CG operator first determines one of the 
camera parameters, Wc, f, and dx, then, changes 
the remaining two parameters, by operating the 
operation section 12. arbitrarily within parameter 
combinations presented from the fusional range ss 

• verification section 11 (combinations of the two 
parameters that satisfy the expressions of Mathe- 
matical 1 and Mathematical 2), and while checking 
the output images, determines the combination that 



he thinks give the best result. 
(3) The CG operator first determines two of the 
camera parameters, Wc, f, and dx, then, changes 
the remaining one parameter, by operating the . 
operation section 12, arbitrarily within the parame- 
ter range presented from the fusional range verifi- 
cation section 11 (the range of the one parameter 
that satisfies the expressions of Mathematical 1 
and Mathematical 2), and while checking the output 
images, determines one that he thinks gives the 
best result. 

The methods of (1) to (3) will be described in further 
detail below. 

In the case of (1), a region (effective region) defin- 
ing combinations of parameters, Wc, f, and dx, where 
the viewer can achieve binocular fusion, is displayed, 
along with a pointer 13 indicating the current combina- 
tion of Wc, f, and dx, on a display section arranged on 
the operation section 12, as shown in Figure 5(a). 

The CG operator changes the position of the 
pointer by using a three-dimensional mouse or the like. 
At this time, the values of Wc, f, and dx change as the 
pointer position changes, but the pointer cannot be 
moved outside the effective region. The parameters at 
the coordinate position pointed to by the pointer are out- 
put to the camera parameter determining section 6. and 
stereoscopic images to be output are calculated by the 
projection transformation sections 1 and 2 and the ren- 
dering sections 3 and 4. By viewing the image produced 
on the stereoscopic image display device, the iCG; oper- 
ator adjusts the position of the pointer as he desires. In 
this way, control is performed so that the output stereo- 
scopic CG images are always produced within the 
viewer's binocular fusional range. 

In the case of (2) and (3), as shown in Figure 5(b), 
a control panel 12a is provided which comprises three 
volume controls 14, 15, and 16 for adjusting the respec- 
tive parameters, and three lock buttons 17, 18, and 19 
for locking the respective parameters. It is assumed 
here that initially the lock buttons are not pressed ON. 
While viewing the output stereoscopic CG - image, first 
the CG operator selects, for example, the focal length f 
out of the camera parameters and determines to set it to 
fO on the operation panel 12a (Figure 5(b)) by consider- 
ing the field of view. The operator then sets the volume 
control 14 to fO and presses the lock button 17. The 
parameter f is thus locked to fO. When the parameterf is 
locked, the fusional range verification section ;1 1 calcu- 
lates combinations of the remaining parameters Wc and 
dx which satisfy both Mathematical 1 and Mathematical 

Next, the CG operator changes the parameters ,Wc 
and dx by operating the volume controls 15 and 16 while 
checking the output images. Here, provisions are-made 
so that the parameters Wc and dx that the CG operator 
is going to set can be changed only within the ranges of 
Wc and dx values that satisfy both Mathematical 1 and 
Mathematical 2. At this time, one or the other; of the two 
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parameters, Wc or dx, can be locked by the lock button 
18 or 19. Then, only the remaining one parameter is 
changed while checking the output stereoscopic CG 
images. In this way, the parameters can be determined 
one by one while constantly keeping the output stereo- 
scopic CG image within the viewer's binocular fusional 
range. 

Using the camera parameters Wc, f, and dx deter- 
mined in the above manner, the projection transforma- 
tion sections 1 and 2 calculate the two-dimensional 
positions of the object that would be projected on films 
when photographed by the right and left cameras, 
respectively, and the rendering sections 3 and 4 deter- 
mine the brightness and color of an image within each 
polygon on the basis of the material of the object, the 
type of the light source used, and the three-dimensional 
positions. Finally, stereoscopic CG images for the left 
and right eyes are output. 

The present embodiment has been described by 
assuming the camera arrangement for converging 
shooting (in Figure 2, the left and right cameras 7 and 8 
are arranged both pointing in the direction of point P). 
Alternatively, the left and right cameras may be 
arranged in parallel to each other as shown in Figure 4. 
In this case, the fusional range verification section 1 1 
need not use the three-dimensional coordinate values 
of the far point of the object, but need only calculate 
combinations of Wc and f that satisfy the condition 
expressed by Mathematical 4. 

[Mathematical 4] 

W c<-W (d s -tan^-AS) 

(This setting is equivalent to setting dx at °°.) 

In the present embodiment, the limits of the 
viewer's binocular fusional range are given by Mathe- 
matical 1 and Mathematical 2. Alternatively, D- and D+ 
in these expressions or depthwise distances corre- 
sponding to these parameters may be entered manually 
by the CG operator. 

In the present embodiment, the camera parameters 
are determined based on one CG image data, but in the 
case of moving images also, the camera parameters 
can be determined in like manner by using CG image 
data at each of successive time instants. Furthermore, if 
a camera parameter sequence over a certain period is 
determined and stored in advance, it is possible to play 
back the same scene any number of times by using the 
stereoscopic camera parameters having the same pat- 
tern of change 

As described above, according to the present 
embodiment, the camera-to-object distance information 
and the magnitude of parallax of generated stereo- 
scopic CG images on the display device are calculated 
from the size of the display device and the viewing dis- 
tance, and by checking whether the CG images fall 
within the viewer's binocular fusional range, proper 



camera parameters (focal length or field of view, camera 
spacing, and converging point) are determined. In this 
manner, easy-to-view stereoscopic CG images can be 
obtained automatically. 

5 

(Embodiment 2) 

Figure 6 is a diagram showing the configuration of a 
stereoscopic CG image generating apparatus accord- 
io ing to a second embodiment of the present invention. In 
Figure 6, reference numerals 1 and 2 are projection 
transformation sections, 3 and 4 are rendering sections, 
and 6 is a camera parameter determining section; these 
sections are the same as those used in the stereoscopic 
15 CG generating apparatus of the first embodiment. The 
present embodiment differs from the stereoscopic CG 
image generating apparatus of the first embodiment in 
that the distance information extraction section 5 and 
fusional range verification section 11 in Figure 1 are 
20 replaced by a parallax map calculation section 20 and a 
fusional region judging section A 21 which acts as a 
pixel count calculating means. 

The operation of the stereoscopic CG image gener- 
ating apparatus having the above configuration will be 
25 described below. 

The present embodiment is particularly effective in 
cases where, of the camera parameters Wc. f. and dx, 
at least one parameter, specifically Wc, is fixed by the 
CG operator and, when output without any adjustment, 
30 the entire stereoscopic CG images cannot be brought 
within the viewer's binocular fusional range. 

Three-dimensional structural information, describ- 
ing a three-dimensional shape of an object by a surface 
model, is input to the projection transformation sections 
35 1 and 2. As in the first embodiment, the CG operator, 
while checking the output images produced on the ster- 
eoscopic image display device (not shown) connected 
to the rendering sections 3 and 4, arranges the object 
and the imaginary camera (at midpoint between left and 
40 right cameras) at appropriate positions in the world 
coordinate system as he desires, thus determining its 
direction. The left and right cameras are arranged at 
positions of -Wc and +Wc, respectively, along the x-axis 
with the imaginary camera "position V at its origin (see 
45 Figure 2). The camera parameters Wc, f, and dx used 
here are preset as initial values (at least one of these 
parameters is fixed). 

Using these preset parameters, the projection 
transformation sections 1 and 2 convert the three- 
50 dimensional structural information into images pro- 
jected on a two-dimensional screen, and the resulting 
images are fed to the rendering sections 3 and 4 which 
then generate CG images. 
- From the outputs of the rendering sections 3 and 4 
55 and the three-dimensional structural information, the 
parallax map calculation section 20 calculates the depth 
data of the left and right images at each point of the pro- 
jection-converted images, that is, a parallax map (an 
image showing the amount of depth at each pixel). For 
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example, by using results of Z buffer processing, a pop- 
ular technique used in CG, it is possible to obtain the 
amount of depth at each point on the screen, and it is 
easy to construct a parallax map using this technique. In 
the case of images such as wireframes that do not 
involve rendering, a parallax map is constructed using 
the outputs of the projection transformation sections \ 
and 2 and the three-dimensional structural information. 

Based on the parallax map, the fusional region 
judging section A 21 calculates the number of pixels 
(this is defined as the effective pixel count), or the 
number of vertices of polygons, or the number of center- 
points of polygons, that are contained in a region on the 
screen that lies within the binocular fusional range of the 
viewer viewing the stereoscopic CG images (the 
fusional range is a range where the parallax takes a 
value between D- and D+, these values being depend- 
ent on the screen size, and a database storing these 
values is included in the fusional region judging section 
A 21). 

Next, while successively changing the camera 
parameters Wc, f. and dx, the fusional region judging 
section A 21 calculates, based on the output of the par- 
allax map calculation section 20, the effective pixel 
count for every possible combination of Wc, f, and dx 
within preset variation ranges, excluding, however, the 
parameter whose value is fixed. 

Then, the camera parameter determining section 6 
computes the parameters Wc, f, and dx that provide the 
largest effective pixel count of all the combinations of 
the parameters Wc, f, and dx for which the effective pixel 
count has been calculated. The thus computed param- 
eters, Wc, f, and dx, are supplied to the projection trans- 
formation sections 1 and 2. 

At this time, rather than selecting the maximum 
value of the effective pixel count, a number of combina- 
tions that provide the effective pixel count close to the 
maximum value may be presented for selection by the 
CG operator, and the selected combination may be sup- 
plied to the projection transformation sections 1 and 2. 

Furthermore, of the three parameters, one or more 
parameters may be fixed, and the camera parameter 
determining section 6 may be made to present the com- 
bination of the remaining parameters that provides the 
largest effective pixel count, or to present a number of 
combinations of the remaining parameters that provide 
the effective pixel count close to the maximum value, for 
selection by the CG operator. 

Using the parameters thus supplied, the projection 
transformation sections 1 and 2 and the rendering sec- 
tions 3 and 4 compute final stereoscopic CG images. In 
this way, the camera parameters can be automatically 
determined to maximize the image portion that falls 
within the viewer's binocular fusional range. 

If the effective pixel count has a plurality of maxi- 
mum values, stereoscopic CG images are generated 
using the parameters for the respective cases, and the 
CG operator selects the desired combination of the 
parameters by checking the results on the stereoscopic 



image display apparatus. • 

As described, according to the present embodi- 
ment, even in cases where there are limitations on the 
camera parameters and the entire stereoscopic CG 

5 images produced for final output cannot be brought 
within the viewer's binocular fusional range, the camera 
parameters Wc, f, and dx can be automatically deter- 
mined to maximize the image area that falls within the 
binocular fusional range. 

w In the second embodiment described above, the 
near point and far point of the object may be computed 
from the parallax map, and thereafter, based on these 
results, the stereoscopic camera parameters may be 
determined using the same method as described in the 

15 first embodiment. 

(Embodiment 3) 

Figure 7 is a diagram showing the configuration of a 

20 stereoscopic CG image generating apparatus accord- 
ing to a third embodiment of the present invention. In 
Figure 7, reference numerals 1 and 2 are projection 
transformation sections, 3 and 4 are rendering sections, 
6 is a camera parameter determining section, and 20 is 

25 a parallax map calculation section; these sections are 
the same as those used in the stereoscopic CG image 
generating apparatus of the second embodiment. 

The differences from the stereoscopic lCG image 
generating apparatus of the second embodiment are 

30 that the fusional region judging section A 21 is replaced 
by a fusional region judging section B 21' .as a pixel 
count calculating means, and that a clipping value 
determining section 22 as a specific image processing 
section is added. 

35 The operation of the stereoscopic CG image gener- 
ating apparatus having the above configuration will be 
described below. First, the camera parameter determin- 
ing section 6 determines the camera parameters (Wc, 
dx, f) to supply to the projection transformation sections 

40 1 and 2 in the same manner as described in the forego- 
ing second embodiment. 

While checking the output images produced on the 
stereoscopic image display device connected to the 
rendering sections 3 and 4, the CG operator arranges 

45 the object and the imaginary camera at , appropriate 
positions in the world coordinate system asi he. desires, 
thus determining its direction. ;i -«. :i t •:' 

Using the thus set parameters, the projection trans- 
formation sections 1 and 2 convert the three-dimen- 

50 sional structural information into images projected on a 
two-dimensional screen, and the resulting . images are 
fed to the rendering sections 3 and 4 which then gener- 
ate CG images. :~ o ;, 

From the outputs of the rendering sections, 3 and 4 

55 and the three-dimensional structural information,.- the 
parallax map calculation section 20 calculates a paral- 
lax map at each point of the projection-converted 
images. : ,-. j 

Based on this parallax map, the fusional region 
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judging section B 21 ' calculates the effective pixel count 
of the region on the screen that lies within the binocular 
fusional range of the viewer viewing the stereoscopic 
CG images, and while successively changing the cam- 
era parameters Wc, f, and dx, calculates the effective 5 
pixel count on the basis of the output of the parallax 
map calculation section 20. 

The fusional region judging section B 2V has a 
database defining relationships between screen size 
and fusional range, and calculates the effective pixel 10 
count by referencing this database. 

Next, the camera parameter determining section 6 
computes the parameters Wc, f, and dx that provide the 
largest effective pixel count of all the combinations of 
the parameters Wc, f , and dx for which the effective pixel 15 
count has been calculated. The thus computed param- 
eters, Wc, f, and dx, are supplied to the projection trans- 
formation sections 1 and 2. 

Using the parameters thus supplied, the projection 
transformation sections 1 and 2 and the rendering sec- 20 
tions 3 and 4 compute final stereoscopic CG images. 
Once the camera parameters have been determined, 
their values are fixed. 

Here, consider the situation where the object is 
moved or the left and right cameras are moved while 25 
maintaining their positional relationship. If the cameras 
are moved toward the object, the distance to the object 
decreases, and the binocular parallax increases, even- 
tually exceeding the viewer's binocular fusional range. 
The same applies for the far point. To address this prob- 30 
lem, clipping is applied while holding the camera param- 
eters fixed. 

In conventional CG image processing, the render- 
ing sections 3 and 4 would apply clipping to a near 
object and a far object so that they would not be dis- 35 
played. In the present embodiment, on the other hand, 
values defining such clipping positions are determined 
for the rendering sections 3 and 4, as shown in Figure 
8(a), so that images outside the binocular fusional 
range will not be output. 40 

That is, the fusional region judging section B 21' 
calculates the limits (near limit and far limit) of the 
viewer s binocular fusional range. More specifically, in 
the world coordinate system of Figure 8(a), all points 
that satisfy Mathematical 1 and Mathematical 2 are 45 
computed. In Figure 8(a), the region consisting of such 
points is defined as the shaded region. 

Next, a near clipping value, CLN, and a far clipping 
value, CLF, are determined so that those points lying 
outside the shaded region will not be included in the sc 
final CG images output for display. (CLNR and CLNL 
are near clipping planes for the right camera and left 
camera, respectively, and CLFR and CLFL are far clip- 
ping planes for the right camera and left camera, 
respectively.) » 

Only objects lying within the region bounded by the 
near clipping planes and far clipping planes are output 
from the rendering section 3 and 4. 

In the above example, the clipping planes, CLNR, 



CLNL, CLFR, and CLFL, are set for the right and left 
cameras, respectively, but alternatively, a near clipping 
plane CLCN and a far clipping plane CLCF may be 
determined with respect to the imaginary camera (ori- 
gin), as shown in Figure 8(b) and these may be used 
common to the right and left cameras. 

In the present embodiment, if there is an object 
lying in a region to be clipped away, settings are made 
so that such an object will not be included in the final 
images output for display. Alternatively, provisions may 
be made to gradually lower the contrast of the object or 
decrease the color intensity of the object as the object 
approaches a region to be clipped away. In this case, 
since the object vanishes in a natural manner as it 
enters a region outside the viewer's binocular fusional 
range, unnaturalness can be greatly reduced in the ster- 
eoscopic CG images output for display. 

As described above, according to the present 
embodiment, even when the camera parameters are 
fixed, by setting suitable clipping planes considering the 
viewer's binocular fusional range the final stereoscopic 
CG images output for display can be brought within the 
viewer's binocular fusional range. 

(Embodiment 4) 

Figure 9 is a diagram showing the configuration of a 
stereoscopic CG image generating apparatus accord- 
ing to a fourth embodiment of the present invention. In 
Figure 9, reference numerals 1 and 2 are projection 
transformation sections, 3 and 4 are rendering sections, 
6 is a camera parameter determining section, 20 is a 
parallax map calculation section, and 21' is a fusional 
region judging section B; these sections are the same 
as those used in the stereoscopic CG image generating 
apparatus of the third embodiment. 

The difference from the stereoscopic CG image 
generating apparatus of the third embodiment is that the 
clipping value determining section 22 is replaced by a 
focus parameter determining section 23 and a fog effect 
parameter determining section 24, these parameters 
being controlled in accordance with the amount of bin- 
ocular parallax of the object concerned. The focus 
parameter determining section 23 and the fog effect 
parameter determining section 24 together constitute a 
specific image processing section. 

The operation of the stereoscopic CG image gener- 
ating apparatus having the above configuration will be 
described below. First, the camera parameter determin- 
ing section 6 determines the camera parameters (Wc, 
dx, f) for the projection transformation sections 1 and 2 
in the same manner as in the foregoing third embodi- 
ment. 

While checking the output images produced on the 
stereoscopic image display device connected to the 
rendering sections 3 and 4, the CG operator arranges 
the object and the imaginary camera at appropriate 
positions in the world coordinate system as he desires, 
thus determining its direction. 
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After that, the projection transformation sections 1 
and 2 convert the three-dimensional structural informa- 
tion into images projected on a two-dimensional screen, 
and the resulting images are fed to the rendering sec- 
tions 3 and 4 which then generate CG images. 

Next, from the outputs of the rendering sections 3 
and 4 and the three-dimensional structural information, 
the parallax map calculation section 20 calculates a par- 
allax map at each point of the projection-converted 
images. Based on the parallax map, the fusional region 
judging section B 21' calculates the effective pixel count 
of the region on the screen that lies within the binocular 
fusional range of the viewer viewing the stereoscopic 
CG images, and while successively changing the cam- 
era parameters Wc, f, and dx, calculates the effective 
pixel count on the basis of the output of the parallax 
map calculation section 20. The fusional region judging 
section B 21' has a database defining relationships 
between screen size and fusional range, and calculates 
the effective pixel count by referencing this database. 

Next, the camera parameter determining section 6 
computes the parameters Wc, f, and dx that provides 
the largest effective pixel count of all the combinations 
of the parameters Wc, f, and dx for which the effective 
pixel count has been calculated. The thus computed 
parameters, Wc, f, and dx. are supplied to the projection 
transformation sections 1 and 2. 

Using the parameters thus supplied, the projection 
transformation sections 1 and 2 and the rendering sec- 
tions 3 and 4 compute final stereoscopic CG images. 
Once the camera parameters have been determined, 
their values are fixed. 

In some cases, the rendering sections 3 and 4 intro- 
duce deliberate distortions, such as defocusing or fog- 
ging distant objects, to express the perspective effect 
when generating final CG images. 

The focus parameter determining section 23 and 
the fog effect parameter determining section 24 deter- 
mine the degrees of defocusing and fogging on the 
basis of the parallax map and the viewer's binocular 
fusional range. 

For example, based on the output of the fusional 
region judging section B 21 ', the focus parameter deter- 
mining section 23 calculates those regions in the world 
coordinate system where the viewer cannot achieve 
binocular fusion. More specifically, three-dimensional 
coordinate positions that do not satisfy either Mathe- 
matical 1 or Mathematical 2 or both are calculated. 

When rendering objects lying within such regions in 
the rendering sections 3 and 4, the focus parameter 
determining section 23 outputs such a focus parameter 
as to give the output image a defocused and unclear 
appearance. 

If this effect is applied gradually increasingly as the 
image nears a limit of the viewer's binocular fusional 
range, a more natural defocusing effect can be given to 
the image. 

To achieve the defocusing effect, a camera out-of- 
focus condition may be simulated using traditional CG 



techniques such as ray tracing, or spatial filtering (e.g., 
low-pass filtering) may be applied to the generated CG 
images. There is also a technique in which, while suc- 
cessively changing the position of the object by small 

5 amounts according to the degree of defocusing, the 
same object is written a number of times to the same 
image memory, thereby blurring the edges. If the move- 
ment in changing the position of the object is made pro- 
portional to the distance from the camera , focused 

w plane, an out-of -focus effect can be achieved. (In the 
present embodiment, the movement should be made 
proportional to the distance from a limit of the viewer's 
binocular fusional range.) 

In this way, defocusing is applied to objects for 

15 which the viewer cannot achieve binocular fusion. This 
has the effect of reducing the unnaturalness arising 
when binocular fusion cannot be achieved. 

Similarly, based on the output of the fusional region 
judging section B 21', the fog effect parameter determin- 

20 ing section 24 calculates those regions in the world 
coordinate system where the viewer cannot achieve 
binocular fusion (especially, such regions where the far 
point condition expressed by Mathematical 2 does not 
hold). The fog effect parameter determining section 24 

25 controls the fog effect parameter so that an, effect is 
given that makes these regions appear as if shrouded in 
fog when the rendering sections 3 and 4 render objects 
lying in these regions. .'■■■i «*.. 

If the fog is made to become thicker as the image 

30 nears a limit of the viewer's binocular fusional range, the 
scene described in the CG images can be made to look 
more natural with distant regions appearing as jf hidden 
behind the fog. ,ii i 

In this way, by applying the fog effect when the bin- 

35 ocular parallax is so large that binocular fusion cannot 
be achieved, as in the case of distant objects, the unnat- 
ural feel that the viewer may have due to an inability to 
achieve binocular fusion can be alleviated, u p-m 

In a specific method of producing a fog effect in ren- 

40 dering objects, a fog coefficient f (0.0 to 1.0) that 
decreases with increasing distance, for exampje, is con- 
sidered. Here, f = 1 means no fog, and when f = 0, the 
image appears completely washed out. H \ r . >. ^ 
The degree of this effect can be defined by Mathe- 

45 matical 5, Mathematical 6, etc., where z denotes the 
distance from the camera ,. i. o 

[Mathematical 5] ■;, ■ iv. - 

: r o ":;i r " 

so f = (far- Z)/ far-near • »' • 

[Mathematical 6] ; y ;v* 

f = exp(-density x z) n o. u 

55 •;< : ; : !'-t' 

Here, far and near respectively indicate the farthest 
point and the nearest point from the camera; in. the gen- 
erated CG image, and density means the density of the 
fog. Rendering color is calculated by Mathematical 7. 
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[Mathematical 7] 

C = fxC G +(1-f)xC f 

Here, Co is the color of the rendered object, and Cf 
is the color of the fog. The fog effect parameter deter- 
mining section 24 sets the coefficient f = 1 when the 
image is inside the viewer's binocular fusional range, 
and smoothly changes f down to 0 as the image nears a 
limit of the binocular fusional range and exceeds the 
limit. 

In this way, the rendering sections 3 and 4 generate 
images such that distant objects outside the binocular 
fusional range appear as if shrouded in fog, thereby 
reducing the unnaturalness arising when the binocular 
fusional range is exceeded. 

As described above, according to the present 
embodiment, if objects are displayed having such binoc- 
ular parallax that binocular fusion cannot be achieved, 
since fogging is applied to distant objects and defocus- 
ing is applied to near and distant objects, the ill effect is 
reduced and easy-to-view stereoscopic CG images can 
be generated. 

(Embodiment 5) 

Figure 13 is a diagram showing the configuration of 
a stereoscopic CG image generating apparatus accord- 
ing to a fifth embodiment of the present invention. In 
Figure 13, reference numerals 1 and 2 are projection 
transformation sections, 3 and 4 are rendering sections, 
6 is a camera parameter determining section, 12 is an 
operation section, 11 is a fusional range verification 
section, 5 is a distance information extraction section 
and 127 is a CG image generating section; these sec- 
tions are the same as those used in the first embodi- 
ment. The difference from the first embodiment is the 
addition of the following sections: a window information 
management section 128, a window information man- 
agement control section 129, a mouse condition detec- 
tion section 130, a display screen size/dot count 
detection section 131, a window size detection section 

132, a window generation/deletion detection section 

133, a window display position detection section 134, a 
window focus change detection section 135, a video 
signal converting section 136, a stereoscopic display 
section 137, a mouse 138, a pair of glasses with liquid- 
crystal shutters 139, and a viewing distance measuring 
means 140. 

The operation of the stereoscopic CG image gener- 
ating apparatus having the above configuration will be 
described below. 

In the present embodiment, multiple kinds of stere- 
oscopic images are displayed simultaneously in differ- 
ent windows of different sizes on a computer screen in 
a windowing environment which has recently become a 
predominant operating environment. On the other hand, 
; in the first to fourth embodiments, the image display size 
was the screen size of the display device itself. 



It is assumed here that, as shown in the stereo- 
scopic display section 137 of Figure 13, there are differ- 
ent windows, A, B, and C, on the same screen, each 
showing a different stereoscopic image. 

5 Existing stereoscopic display techniques can be 

used to display stereoscopic images. In the present 
embodiment, the so-called time-multiplexing stereo- 
scopic image display technique is used in which stereo- 
scopic images converted by the video signal converting 

10 section 136 into video signals are input to the stereo- 
scopic display section 137 and the viewer views the 
stereoscopic images through the liquid-crystal shutter 
glasses 139. More specifically, the video signal convert- 
ing section 1 36 supplies the R and L video signals alter- 

75 nately, that is, first R, then L, then R, then L, and so on, 
in time multiplexing for display on the stereoscopic dis- 
play section 137; when the right-eye image is displayed, 
the right-eye glass of the liquid-crystal shutter glasses 
139 admits light and the left-eye glass blocks light, and 

20 when the left-eye image is displayed, the situation is 
reversed. In this way, the right-eye and left-eye images 
can be presented independently to the right eye and left 
eye of the viewer. Any other existing stereoscopic image 
display technique (such as using polarizers or lenticular 

25 lenses) may be employed. Usually, the viewer is allowed 
to resize the windows A, B, and C as he desires by 
using the mouse 138. 

When the display screen size of stereoscopic 
images changes as a result of a change in window size, 

30 the viewer's binocular fusional range also changes. Fig- 
ure 22 shows the relationship between the screen size 
(viewing angle) for displaying stereoscopic images and 
the maximum fusional parallax (expressed in angles, 
unit being [arc min]). It is shown that the allowable bin- 

35 ocular fusional range changes as the display screen 
size changes. A larger screen size provides a larger 
fusional range. Accordingly, when the window size is 
reduced while the window is displaying the same stere- 
oscopic image, the resulting parallax may exceed the 

40 binocular fusional range; therefore, the sizes of all the 
windows must be monitored constantly, and the camera 
parameters must always be determined accordingly. 
More specifically, information about the window oper- 
ated by the viewer using the mouse is detected by the 

45 window information management control section 129, 
and based on the detected information, the screen sizes 
of all the windows currently displayed are supplied to 
the fusional range verification section 11. In operation, 
the windows currently displayed are managed by the 
so window generation/deletion detection section 133, and 
the size of each individual window is determined by the 
window display position detection section 134. window 
size detection section 132, and display screen size/dot 
count detection section 131. More specifically, the size 
55 of each window actually displayed (in inches, centime- 
ters, etc.) is calculated from the display screen size (in 
inches), the horizontal and vertical dot counts of the dis- 
play (these can be computed by detecting synchroniza- 
tion frequencies), and the size of the window (dot 
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count), and the thus calculated window size is supplied 
to the fusional range verification section 1 1 . The screen 
size, for example, can be obtained by having the window 
information management section 128 control the video 
signal converting section 136 and the number of dots 5 
displayed on the stereoscopic display section 137. 

The remainder of the process is the same as that 
described in the first embodiment. That is, the distance 
information obtained from the three-dimensional struc- 
tural information is detected by the distance information 10 
extraction section 5, and using this distance information 
and the distance, ds, between viewer and display sur- 
face measured by the viewing distance measuring 
means 140, the camera parameters are calculated by 
Mathematical 1 and Mathematical 2. The camera 15 
parameters are supplied to the projection transforma- 
tion sections 1 and 2, and the right-eye and left-eye 
images, R and L, are calculated by the rendering sec- 
tions 3 and 4, respectively. This processing is performed 
separately for each of the stereoscopic display windows 20 
detected by the window information management sec- 
tion 128. 

As described above, in a display system having a 
windowing environment for displaying a plurality of ster- 
eoscopic images, the window information management 25 
section 1 28 supervises the size of each window, and the 
camera parameters are controlled, and hence the paral- 
lax is controlled, so that the stereoscopic image dis- 
played in each window comes within the viewer's 
binocular fusional range. In this way, easy-to-view, natu- 30 
ral-looking images can be presented. 

In the fifth embodiment, using the output of the win- 
dow focus change detection section 135, the camera 
parameters may be changed only for the window speci- 
fied by the viewer's mouse operation so that only the 35 
stereoscopic image displayed in the viewer's attention 
window is controlled within the binocular fusional range. 
In this way, the operational efficiency of the present 
invention can be enhanced. 

In any of the first to fourth embodiments, the view- 40 
ing distance between the viewer and the display screen 
may be measured using the viewing distance measur- 
ing means 140 shown in the fifth embodiment. 

As described so far, according to the present inven- 
tion, the distance information between the camera and 45 
object and the magnitude of parallax of generated ster- 
eoscopic CG images displayed on the display device 
are calculated from the display size and the viewing dis- 
tance, and based on which proper camera parameters 
(focal length or field of view, camera spacing, and con- so 
verging point) are determined. In this way, easy-to-view 
stereoscopic CG images can be obtained automatically. 

The first to fifth embodiments have been described 
using binocular stereoscopic images, but this is not 
restrictive. For multinocular stereoscopic images also, if 55 
the same techniques as described above are applied to 
determine the camera parameters for all pairs of images 
presented to the left and right eyes of the viewer, multi- 
nocular stereoscopic CG images can be generated eas- 



ily. 

In the first to fifth embodiments, the camera param- 
eters have been determined so as to bring the stereo- 
scopic CG images within the viewer's binocular fusional 
range for the entire screen generated. However, in the 
case of a scene that forces the viewer to focus his atten- 
tion on a particular object on the screen, fori example, 
other regions than the attention object may be set so 
that binocular fusion cannot be achieved for such 
regions. In such cases, the CG operator can easily set 
such regions on the output screen that need not be 
brought within the viewer's binocular fusionali range so 
that data from these regions are not used in determining 
the camera parameters. j 

In the first to fifth embodiments, stereoscopic 
images for the left and right eyes are obtained by CG, 
but any of the embodiments is also applicable for real 
images shot by a stereoscopic camera. In that case, the 
focal length f of the plurality of cameras, the camera 
spacing Wc, the camera-to-converging point distance 
dx (the distance from the point of intersection between 
the optic axes of the cameras to the centerpoint 
between the plurality of cameras) can be directly used 
as the camera parameters for the actual camera. In this 
case, however, the variable M in Mathematical rand 
Mathematical 2 is not the screen size, but ther ratio 
between the size of the light-receiving surface of the 
camera's imaging device and the size of .the; screen 
where stereoscopic images are actually displayed, i j 

In the fourth embodiment, both the focus parameter 
determining section 23 and the fog effect parameter 
determining section 24 have been provided,^ but only 
one or other of the two may be provided. ,1 > ' 

In any of the first to fifth embodiments, «the camera 
parameters have been determined so as to bring the 
stereoscopic CG images within the viewer's: binocular 
fusional range for the entire screen generated, but this 
is not restrictive. Rather, provisions may be made so 
that the CG operator can set such regions oh the output 
screen that need not be brought within the viewer s bin- 
ocular fusional range, and so that data .from;. these 
regions are not used in determining the camera param- 
eters. : y ' . 

In any of the first to fifth embodiments, processing 
sections, such as the distance information extraction 
section and the fusional range verification section, have 
each been implemented using dedicated hardware; but 
instead, the same functions may be implemented in 
software using a computer. -\ t \ c , r 

h : . r • 

(Embodiment 6) - 

Figure 14 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to. . a, sixth 
embodiment of the present invention. In Figure 14, A1 
and A2 are CRTs, A3 and A4 are linear polarizers, A5 is 
a half-silvered mirror, A6 is a pair of glasses formed 
from polarizing filter, A7 is a viewer, A8 is a parallax cal- 
culation section, A9 is a resolution discrimination sec- 
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tion, A10 is an optimum parallax determining section, 
A1 1 is a basic synchronization timing generating sec- 
tion, A12a and A12b are synchronization sections, 
A13a and A13b are parallax control sections, A14a and 
A14b are RGB separation sections, A15a and A15b are 
CRT driving sections, and A16 is a viewing distance 
measuring section. 

The operation of the stereoscopic TV apparatus 
having the above configuration will be described below. 
First, a right-eye image signal is applied to the resolu- 
tion discrimination section A9, the synchronization sec- 
tion Al2a, and the parallax calculation section A8. 

The resolution discrimination section A9 detects 
the horizontal and vertical frequencies of the input 
image signal and discriminates the resolution of the 
input image. The basic synchronization timing generat- 
ing section A1 1 generates synchronization timing data 
matching the detected horizontal and vertical frequen- 
cies of the input image, and supplies the data to the syn- 
chronization sections A12a and A12b, which are thus 
synchronized to the input image signal and generate 
synchronization timing necessary for subsequent 
processing. 

From the right-eye and left-eye image signals, the 
parallax calculation section A8 calculates depth infor- 
mation (this is defined as a parallax map) at each point 
of the input image. A variety of methods are proposed 
for parallax map calculation. A block matching method 
that involves correlation computation will be described 
below. 

In Figure 15, consider left-eye and right-eye images 
each of NxM size. In the left-eye image, consider a 
block window of n x n pixels (3x3 pixels in the figure). 
The same image as shown in this block window is 
located in the right-eye image by using a window of the 
same size. At this time, the displacement between the 
left and right blocks is represented by a vector (Ax, Ay), 
whose horizontal component Ax indicates the binocular 
parallax of the left-eye and right-eye images at the 
center coordinates of the block windows. 

By horizontally shifting the block window position in 
the reference left-eye image in sequence across the 
entire screen, and by finding the corresponding block 
position (representing the binocular parallax) in the 
right-eye image for each shifted block position, a paral- 
lax map (showing depthwise distance at each position 
on the screen) can be obtained for the entire screen. 
The displacement between the left-eye and right-eye 
images at coordinates (x, y), that is, the binocular paral- 
lax (Ax, Ay), can be expressed as 

[Mathematical 8] 

Ax = i, for Min {Corr (i, j)} 

where 



[Mathematical 9] 

nxn 

Corr (i, j) = £ | GL (Xk, Yk) -GR (Xk-i, Yk-j) | 

k-1 

5 

In Mathematical 9, £ means taking the sum of the abso- 
lute values by varying the coordinates xk, yk within the 
block window of n x n. GR(xk, yk) and GL(xk, yk) repre- 
sent luminance values at coordinates (xk, yk) in the 
10 right-eye and left-eye images, respectively. 

In the binocular parallax Ax, Ay, the component that 
directly indicates the depthwise position is Ax. When the 
value of the binocular parallax is positive, the right-eye 
image is positioned to the right and the left-eye image to 
15 the left of the reference image, and the object lies 
behind the depthwise position where binocular parallax 
is 0; on the other hand, when the value of the binocular 
parallax is negative, this means that the object is posi- 
tioned in front of the depthwise position where binocular 
20 parallax is 0. 

From the parallax map obtained in the above man- 
ner, the parallax calculation section A8 outputs, for 
example, the largest value (the binocular parallax of the 
farthest object). Instead of simply extracting the maxi- 
25 mum value of the binocular parallax, spatial low-pass fil- 
tering may be applied, or a plurality of extraction regions 
may be preset and calculations may be made using a 
statistical technique. 

Next, the optimum parallax determining section 
30 A10 determines the amount of horizontal translation of 
the left-eye and right-eye images so that the viewer of 
the stereoscopic TV can fuse the displayed stereo- 
scopic images. This translation amount is determined 
on the basis of the output of the resolution discrimina- 
35 tion section A9 (the result obtained by judging the image 
resolution and aspect ratio based on the kind of input 
image signal detected), the image display size (in this 
case, CRT diagonal size expressed in inches), the out- 
put of the parallax calculation section A8 (the parallax 
40 map), and the distance between viewer and display sur- 
face measured by the viewing distance measuring sec- 
tion A1 6. .i" 

The optimum parallax determining section A10 has 
a database defining relationships between display 
45 screen size and viewer's fusional limits, and by refer- 
encing this database, determines the amount of hori- 
zontal translation so that the viewer can achieve 
binocular fusion. 

The method of determining this will be described in 
50 further detail. Denoting the largest binocular parallax 
output from the parallax calculation section A8 by A 
(dots), the horizontal dot count of the input image signal 
detected by the resolution discrimination section A9 by 
DH, the horizontal length of the display CRTs AT and A2 
55 by L, and the viewer's viewing distance measured by the 
viewing distance measuring section A16 by ds, the larg- 
est parallax Dm on the screen is given by [Mathematical 
10]. 
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[Mathematical 10] 

A . L . Dm 

The left-eye and right-eye images are translated hori- 
zontally so that Dm becomes almost equal to the 
viewer's binocular parallel condition or provides a 
smaller angle than this. For example, to make the maxi- 
mum value of the binocular parallax coincide with the 
viewer s binocular parallel condition, the amount of hor- 
izontal translation, Dc, is given by [Mathematical 11]. 

[Mathematical 11] 

Dc = Dm-We 

Here, We is the interpupillary distance of the viewer, 
which, in practice, is adjusted by shifting the left-eye and 
right-eye images in opposite directions horizontally by 
Dc/2. The amount of translation, Dc, may be adjusted, 
as necessary, based on the result derived from the 
above equation. 

Further, in Mathematical 10, if the smallest binocu- 
lar parallax (the largest parallax when displaying the 
object in the foreground on the screen) is set as A, then 
the optimum parallax determining section A10 deter- 
mines the amount of horizontal translation of the screen 
so that Dm becomes smaller than the viewer's largest 
fusional parallax (which varies depending on the screen 
size). 

Based on the thus obtained amount of horizontal 
translation, Dc, the parallax control sections A13a and 
A13b move the right-eye and left-eye images in oppo- 
site directions horizontally by Dc/2. Then, the image sig- 
nals are separated by the RGB separation sections 
A14a and A14b into the R, G, and B signals, which are 
supplied to the CRTs A1 and A2 via the CRT driving 
sections A15a and A15b. The images displayed on the 
CRTs A1 and A2 are linearly polarized by the respective 
polarizers A3 and A4 oriented at right angles to each 
other, and the polarized images are combined by the 
half-silvered mirror A5. By wearing the polarizing 
glasses A6 with their planes of linear polarization ori- 
ented in directions corresponding to the polarizers A3 
and A4, the viewer A7 can view the left-eye image with 
his left eye and the right-eye image with his right eye, 
thus achieving stereoscopic vision. 

As described above, according to the present 
embodiment, by discriminating the kind of input image 
signal and computing the size of the display screen, the 
viewer can always view natural-looking stereoscopic 
images displayed with Optimum binocular parallax. 

(Embodiment 7) 

Figure 16 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to a seventh 
embodiment of the present invention. In Figure 16, A1 is 



a CRT, A18 is a pair of liquid-crystal shutter glasses, A7 
is a viewer, A8 is a parallax calculation section, A9 is a 
resolution discrimination section, A10 is an optimum 
parallax determining section, A 11 is a basic synchroni- 

5 zation timing generating section, A12 is a synchroniza- 
tion section, A13 is a parallax control section, A14 is an 
RGB separation section, A15 is a CRT driving section, 
A1 6 is a viewing distance measuring section, and A1 7 is 
a liquid-crystal shutter switching pulse generating sec- 

w tion. 

This configuration is an adaptation of the stereo- 
scopic TV apparatus of the sixth embodiment for use 
with a field sequential stereoscopic image signal. 

The operation of the stereoscopic TV apparatus 

15 having the above configuration will be described below. 
The basic operation is the same as the sixth embodi- 
ment, but since the left-eye and right-eye images are 
time-multiplexed on one stereoscopic image signal and 
are input alternately with each other, as shown in Figure 

20 1 7, the following processing becomes necessary. 

That is, the liquid-crystal shutter switching pulse 
generating section A1 7 outputs the liquid-crystal shutter 
control signal shown in Figure 17, in response to which 
the left-eye shutter in the liquid-crystal shutter glasses 

25 A1 8 is opened to admit light when the right-eye shutter 
is closed to block light, and vice versa. " . 

First, the right-eye image signal is input to the reso- 
lution discrimination section A9. the synchronization 
section A12, and the parallax calculation section A8. 

30 The resolution discrimination section A9 detects the 
horizontal and vertical frequencies of the input image 
signal and discriminates the resolution of the input 
image. The basic synchronization timing generating 
section A1 1 outputs synchronization timing data match- 

35 ing the detected horizontal and vertical frequencies of 
the input image, and the synchronization section A1 2 is 
synchronized to the timing of the image signal, i ( -i 

The parallax calculation section A8 calculates, the 
parallax map of the input image from the right-eye and 

40 left-eye image signals input alternately by time multi- 
plexing. The calculation of the parallax map can be 
made in exactly the same manner as in t the sixth 
embodiment. ; , >: 

Then, the parallax calculation section. A8 outputs, 

45 for example, the binocular parallax of the most distant 
object among the binocular parallaxes obtained at the 
respective points of the image. At this time,, in calculat- 
ing the binocular parallaxes, spatial low-pass filtering 
may be applied, or a plurality of extraction regions may 

so be preset and calculations may be made using, a statis- 
tical technique. , - \, ,\ 

Next, based on the output of the resolution discrim- 
ination section A9, the image display size, the output of 
the parallax calculation section A8, and the; distance 

55 between viewer and display surface, the optimum paral- 
lax determining section A10 determines the amount of 
horizontal translation of the left-eye and .rightreye 
images so that the stereoscopic images displayed can 
be fused with both eyes. .; , 
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The method of determining the translation amount 
is exactly the same as that described in the sixth 
embodiment. That is, denoting the largest binocular par- 
allax output from the parallax calculation section A8 by 
A (dots), the horizontal dot count of the input image sig- 
nal detected by the resolution discrimination section A9 
by DH, the horizontal length of the display CRT Al by L, 
and the viewer's viewing distance measured by the 
viewing distance measuring section A1 6 by ds, the larg- 
est parallax Dm on the screen is given by [Mathematical 
10]. To make the maximum value of the binocular paral- 
lax coincide with the viewer's binocular parallel condi- 
tion, the amount of horizontal translation, Dc, is given by 
[Mathematical 1 1]. However, the amount of translation, 
Dc, may be adjusted, as necessary, based on the result 
derived from this equation. 

Further, in Mathematical 10, if the smallest binocu- 
lar parallax (the largest parallax when displaying the 
object in the foreground on the screen) is set as A, then 
the optimum parallax determining section A10 deter- 
mines the amount of horizontal translation of the screen 
so that Dm becomes smaller than the viewer's largest 
fusional parallax (which varies depending on the screen 
size). 

Based on the thus obtained amount of horizontal 
translation, Dc, the parallax control section A13 move 
the right-eye and left-eye images in opposite directions 
horizontally by Dc/2. At this time, since the left-eye and 
right-eye signals are input as a time-multiplexed stereo- 
scopic image signal, the screen display is switched 
between the left-eye and right-eye images. Therefore, 
the amount of horizontal translation of the image is 
switched from +Dc/2 to -Dc/2 or vice versa between 
fields. 

The image signal is then separated by the RGB 
separation section A14 into the R, G, and B signals, 
which are supplied to the CRT A1 via the CRT driving 
section A15. The stereoscopic images displayed on the 
CRT Al, alternating between the left-eye and right-eye 
images, are presented independently to the respective 
eyes of the viewer wearing the liquid-crystal shutter 
glasses A18. 

As described above, according to the present 
embodiment, even when the input image signal is a 
time-multiplexed stereoscopic image signal, the viewer 
can always view natural-looking stereoscopic images 
displayed with optimum binocular parallax. 

(Embodiment 8) 

Figure 21 is a diagram showing the configuration of 
a stereoscopic TV apparatus according to an eighth 
embodiment of the present invention. In Figure 21 , A1 
and A2 are CRTs, A3 and A4 are linear polarizers, A5 is 
a half-silvered mirror, A6 is a pair of glasses formed 
from polarizing filter, A7 is a viewer, A8 is a parallax cal- 
culation section, A9 is a resolution discrimination sec- 
tion, A10 is an optimum parallax determining section, 
A1 1 is a basic synchronization timing generating sec- 



tion, A12a and A12b are synchronization sections, 
A13a and A13b are parallax control sections, A14a and 
A14b are RGB separation sections, A15a and A15b are 
CRT driving sections, and A16 is a viewing distance 

5 measuring section; these sections are the same as 
those used in the sixth embodiment. 

The difference from the sixth embodiment is the 
addition of the following sections: a window information 
management section A27, a window information man- 

10 agement control section A26, a mouse condition detec- 
tion section A25, a window size detection section A22, a 
window generation/deletion detection section A23, a 
window focus change detection section A24, and a 
mouse A28. 

75 The operation of the stereoscopic TV apparatus 
having the above configuration will be described below. 

In the sixth and seventh embodiments, the image 
display size was the screen size of the display appara- 
tus itself regardless of the synchronization frequency of 

20 the input video signal. On the other hand, in the present 
embodiment, multiple kinds of stereoscopic images are 
displayed simultaneously in different windows on a com- 
puter screen in a windowing environment which has 
recently become a predominant operating environment. 

25 And parallax is controlled in response to such condi- 
tions that viewer changes the size of the window by 
using a mouse. 

It is assumed here that a plurality of windows are 
displayed on the screen of each of the CRTs A1 and A2 

30 of Figure 21, and that stereoscopic images are dis- 
played in one of the windows. 

Usually, the viewer is allowed to resize each win- 
dow as he desires by using the mouse A28. When the 
size of the stereoscopic images changes as a result of 

35 a change in window size, not only the parallax of the dis- 
played stereoscopic images but the viewer's binocular 
fusional range also changes. Therefore, the window 
size must be monitored constantly, and the parallax 
must always be controlled accordingly More specifi- 

40 cally, information about the window operated by the 
viewer using the mouse is detected by the window infor- 
mation management control section A26. 

The window information management control sec^ 
tion A26 manages the currently displayed windows by 

45 the window generation/deletion detection section A23, 
and the size of each individual window is detected by 
the window size detection section A22. Data represent- 
ing the size of the applicable window is output to the 
optimum parallax determining section A10. 

so The optimum parallax determining section A10 
obtains the horizontal and vertical dot counts of the dis- 
play screen and the size of each window (dot count) 
from the outputs of the resolution discrimination section 
A9 and the window size detection section A22, and 

55 based on the thus obtained information and on the infor- 
mation about the size of the entire image display area 
(CRT diagonal size in inches), calculates the size of 
each window actually displayed fln inches, centimeters, 
etc.). 
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The remainder of the process is the same as that 
described in the sixth embodiment. That is, from the 
right-eye and left-eye image signals, the parallax calcu- 
lation section A8 calculates depth information at each 
point of the input image, and outputs a maximum or 
minimum value, for example. 

Next, . the optimum parallax determining section 
A10 obtains the actual size of the display window from 
the output of the resolution discrimination section A9 
(the result obtained by judging the image resolution and 
aspect ratio based on the kind of input image signal 
detected), the entire image display size (in this case, 
CRT diagonal size expressed in inches), and the display 
window size (dot count) output from the window infor- 
mation management section A27, and determines the 
amount of horizontal translation of the left-eye and right- 
eye images so that the viewer of the stereoscopic TV 
can fuse the displayed stereoscopic images. This trans- 
lation amount is determined, by using Mathematical 1 0 
and Mathematical 1 1 , on the basis of the output of the 
parallax calculation section A8 (the parallax map) and 
the distance between viewer and display surface meas- 
ured by the viewing distance measuring section A 16,. 

Based on the thus obtained amount of horizontal 
translation, the parallax control section A1a and A13b 
move the right-eye and left-eye images in opposite 
directions horizontally so that the displayed images will 
come within the viewer's binocular fusional range. Then, 
the image signals are separated by the RGB separation 
sections A14a and A14b into the R, G, and B signals, 
which are passed through the window information man- 
agement control section A26 and are output to the 
CRTs A1 and A2, via the CRT driving sections A15a 
and A15b, for display in the designated window on the 
display screen. 

When processing the input image signals for dis- 
play in a plurality of windows of different sizes, the 
amount of horizontal translation described above 
should be calculated independently for each window. 
Further, when there are different image signals for dis- 
play in different windows of respectively determined 
sizes, the processing should also be performed inde- 
pendently for each window. 

When the viewer has changed the size of a window 
by using the mouse A28, the window size detection sec- 
tion A22 detects a change in the window size, upon 
which the optimum parallax determining section A10 
calculates the amount of horizontal translation of the 
left-eye and right-eye images, and the result is immedi- 
ately reflected on the display screen. 

When displaying a plurality of stereoscopic images 
in a plurality of windows, provisions may be made so 
that the window specified by the user using the mouse 
is detected as the viewer's attention window by using 
the window focus change detection section A24, and 
the camera parameters are changed only for that win- 
dow, thus controlling only the stereoscopic image dis- 
played in the viewer's attention window within the 
binocular fusional range. In this way, the operational effi- 



ciency of the present invention can be enhanced. 

As described above, in a display system having a 
windowing environment where there occur changes in 
window size, the stereoscopic images displayed in each 

5 individual window can be controlled within the viewer's 
binocular fusional range by monitoring the window size 
• using the window information management section 
A27. | 

In the sixth, seventh, and eighth embodiments, the 

10 viewing distance was measured using the viewing dis- 
tance measuring section A16, but alternatively, a fixed 
value may be used, such as a recommended viewing 
distance obtained from the CRT size. 

In the sixth, seventh, and eighth embodiments, the 

75 viewing distance measuring section A16 may be con- 
structed to measure the viewing distances for a plurality 
of viewers and to output the average or weighted aver- 
age of the measured values or the maximum or mini- 
mum value thereof, thus performing parallax control 

20 considering the viewing distances of all the viewers 
involved. Further, in an environment where a plurality of 
viewers are viewing different windows, if binocular par- 
allax is controlled by independently setting a viewing 
distance for each window, optimum stereoscopic 

25 images can be presented to each individual viewer. 

In the sixth, seventh, and eighth embodiments, the 
optimum parallax determining section A10 calculated 
the largest parallax Dm on the screen by using the out- 
put of the parallax calculation section A8, the output of 

30 the resolution discrimination section A9, the horizontal 
length L of the display CRTs A1 and A2, and the 
viewer's viewing distance ds measured by the viewing 
distance measuring section A16. Depending on the Kind 
of input image signal, however, the produced display 

35 may not use the entire area of the CRT iscreen. To 
address this, the resolution discrimination section A9 
may be provided with a database defining relationships 
between the kind of input image signal (HDTV, NTSC, 
EDTV, computer-generated images, etc.) and. display 

40 screen size, and may be constructed to be able .^cor- 
rectly recognize the magnitude of displayed, binocular 
parallax according to the kind of input image signal. 

In the sixth, seventh, and eighth embodiments, the 
parallax calculation section A8 was described as out- 

45 putting the maximum value of the binocular parallax, but 
instead, the minimum value may be used so that the 
parallax of the nearest object appearing floating above 
the screen can be brought within the viewer S binocular 
fusional range. In this case, however, since the magni- 

50 tude of the allowable binocular parallax changes as a 
function of the viewing distance and screen size, ,a data- 
base defining the values of allowable binocular, ; paral- 
laxes must be provided. ;'.;.< 
The sixth, seventh, and eighth embodiments have 

55 been described as using multisync monitors, but in. the 
case of a monitor specifically designed for use .with a 
fixed frequency image signal, the resolution discrimina- 
tion section need not be provided, and a fixed value may 
be used in the product specification. 
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Further, typical screen sizes, such as 1/2 or 1/3 of 
the full screen size, may be predefined by fixed values. 

In the sixth, seventh, and eighth embodiments, the 
parallax control section A13 may be constructed to 
operate whenever the amount of translation, Dc, is cal- 
culated, or to operate only at the start of the apparatus 
operation and when there occurs a significant change in 
the binocular parallax of the input image. 

In the sixth, seventh, and eighth embodiments, pro- 
visions may be made so that the viewer enters adjust- 
ment commands using pushbutton switches or a remote 
controller only when he desires to adjust binocular par- 
allax. 

In the seventh embodiment, a time multiplexing 
system requiring the use of liquid-crystal shutters was 
used to produce a final stereoscopic image display, but 
it will be recognized that any other stereoscopic display 
method may be used, such as a parallax barrier method 
and a lenticular-lens method that does not require 
glasses. 

As is apparent from the descriptions so far given, 
since stereoscopic images are generated, or left-eye 
and right-eye images are automatically moved horizon- 
tally prior to image presentation, by considering the dis- 
play screen size of the stereoscopic TV apparatus, the 
resolution (frequency) of the input image signal, and the 
window size, the present invention has the advantage of 
being able to generate and present stereoscopic 
images looking natural and easy to view for the viewer. 

Claims 



a camera parameter determining section for a 
CG operator to determine said camera param- 
eters from an output of said camera parameter 
calculating means; 
5 whereby said projection transformation section 

generates said plurality of two-dimensional 
projection images by using said determined 
camera parameters. 

w 2. A stereoscopic CG image generating apparatus 
according to claim 1 , wherein said distance infor- 
mation extraction section extracts from said three- 
dimensional structural information the longest dis- 
tance or shortest distance between said cameras 

15 and said object. 

3. A stereoscopic CG image generating apparatus 
according to claim 2, wherein said longest distance 
or shortest distance between said cameras and 

20 said object can be manually set by said CG opera- 
tor as he desires. 

4. A stereoscopic CG image generating apparatus 
according to claim 2, wherein 

25 said CG operator designates a specific 

region on a CG display screen, and 

from the three-dimensional structural infor- 
mation within said specific region, said distance 
information extraction section extracts the longest 

30 distance or shortest distance between said cam- 
eras and said object. 



1. A stereoscopic CG image generating apparatus 5. 
comprising: 

35 

a projection transformation section for, based 
on three-dimensional structural information 
describing a three-dimensional shape of an 
object, generating a plurality of two-dimen- 
sional projection images as viewed from a plu- 40 
rality of cameras; 

a distance information extraction section for 
generating a distance between said object and 
said cameras; 

fusional range computing means for computing 45 
a binocular fusional range of a viewer viewing a 
screen of a stereoscopic image display device 
for displaying a stereoscopic image of said 
object, on the basis of pre-entered parameters 
consisting at least of the size of said screen so 
and a viewing distance between said screen 
and said viewer; 

camera parameter calculating means for calcu- 
lating conditions for parameters of said cam- 
eras, based on said binocular fusional range 55 
and on the output of said distance information 
extraction section, so that said object in its 
entirety can be brought within the binocular 
fusional range of said viewer; and 



A stereoscopic CG image generating apparatus 
comprising: 

a projection transformation section for, based 
on three-dimensional structural information 
describing a three-dimensional shape of an 
object, generating a plurality of two-dimen- 
sional projection images as viewed from a plu- 
rality of cameras; 

a rendering section for generating a CG image 
from the output of said projection transforma- 
tion section; 

a parallax map calculation section for generat- 
ing a distance image of an output image from 
the output of said projection transformation 
section or said rendering section and from said 
. three-dimensional structural information; 
fusional range computing means for computing 
a binocular fusional range of a viewer viewing a 
screen of a stereoscopic image display device 
for displaying a stereoscopic image of said 
object, on the basis of pre-entered parameters 
consisting at least of the size of said screen 
and a viewing distance between said screen 
and said viewer; 

pixel count calculating means for calculating, 
based on said binocular fusional range and on 
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the output of said parallax map calculating 
means, the number of pixels or the number of 
vertices of polygons or the number of center- 
points of polygons in the stereoscopic image 
lying within the binocular fusional range of said 5 
viewer; and 

a camera parameter determining section for 
determining camera parameters, by using the 
output of said pixel count calculating means, so 
that a region in the output image in which said 10 
viewer can achieve binocular fusion becomes 
larger in area than a prescribed value; 
whereby said projection transformation section 
generates said plurality of two-dimensional 
projection images by using said determined 15 
camera parameters. 

6. A stereoscopic CG image generating apparatus 
according to claim 5, wherein 

a CG operator designates a specific region 20 
on a display screen, and 

said pixel count calculating means calcu- 
lates the number of pixels in a CG image lying 
within the binocular fusional range of said viewer, 
on the basis of parallax information within said spe- 25 
cific region, the screen size of said stereoscopic 
image display device, and the viewing distance 
between said screen and said viewer. 

7. A stereoscopic CG image generating apparatus 30 
according to claim 6, wherein 

said pixel count calculating means detects a 
region of a subject in which said viewer can achieve 
binocular fusion, and 

a specific image processing section is 35 
included for applying specific image processing to 
image portions outside the binocular fusible region 
detected by said pixel count calculating means. 

8. A stereoscopic CG image generating apparatus 40 
according to claim 7, wherein said specific image 
processing section includes a clipping value deter- 
mining section for determining the location of a clip- 
ping plane so that CG images lying in portions 
outside said binocular fusible region will not be gen- 45 
erated. 

9. A stereoscopic CG image generating apparatus 
according to claim 7, wherein said specific image 
processing section includes a rendering section for so 
gradually changing an image contrast or transpar- 
ency of the subject over an area extending from a 
neighborhood of a binocular fusional limit of said 
viewer into a region where the binocular fusion can- 
not be achieved. 55 

10. A stereoscopic CG image generating apparatus 
according to claim 7, wherein said specific image 
processing section includes a fog effect parameter 



determining section for controlling the degree of a 
fog effect in such a manner as to increase the fog 
effect as a binocular fusional limit of said viewer is 
exceeded. 

11. A stereoscopic CG image generating apparatus 
according to claim 7, wherein said specific image 
processing section includes a focus ' parameter 
determining section for controlling the degree of a 
defocusing effect in such a manner as to increase 
the defocusing effect as a binocular fusional limit of 
said viewer is exceeded. 

12. For use in an windowing environment where one or 
more stereoscopic CG images are displayed simul- 
taneously, a stereoscopic CG image generating 
apparatus comprising: 

a projection transformation section for, based 
on three-dimensional structural information 
describing a three-dimensional shape of an 
object, generating a plurality of two-dimen- 
sional projection images as viewed from a plu- 
rality of cameras; 

a distance information extraction section for 
generating a distance between said object and 
said cameras; 

a window information management section for 
detecting the size of each individual window 
where a stereoscopic image is displayed and 
information about a video resolution or syn- 
chronization frequency of a display screen; 
a fusional range verification section, for calcu- 
lating from the output of said window informa- 
tion management section the size of a window 
on a stereoscopic image display device in 
which the two-dimensional projection images 
of said object are displayed as main images, 
and for calculating, from the size of jSaid win- 
dow, the output of said distance information 
extraction section, and a viewing distance of a 
viewer, camera parameters for each, individual 
window in order to bring the stereoscopic CG 
images within a binocular fusional range of said 
viewer; and 

a camera parameter determining section for, by 
using the output of said fusional range verifica- 
tion section, determining camera parameters 
for stereoscopic images to be displayed in each 
individual window; 

whereby said projection transformation section 
generates said plurality of two-dimensional 
projection images by using said = determined 
camera parameters. | >( 

13. A stereoscopic CG image generating apparatus 
according to claim 12 .wherein . .. . , i: 

said camera parameter determiing section 
changes ,only when the viewer points such window 
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to be adjusted , only camera parameters of the ster- 
eoscopic images corresponding to said pointed 
widow. 



from the size of said window, the output of said 
parallax calculation section, and the output of 
said viewing distance measuring section, and 
for computing an amount of parallax change 
5 necessary to bring said binocular parallax 

within a binocular fusional range of said viewer; 
and 

a parallax control section for, in accordance 
with the output of said optimum parallax deter- 

io mining section, translating the left-eye and 

right-eye images in horizontal directions so that 
the stereoscopic image displayed in each indi- 
vidual window will be brought within the binoc- 
ular fusional range of said viewer even if the 

15 size of each individual window or the synchro- 

nization frequency of the input image fre- 
quency changes due to an operation by said 
viewer. 



14. A stereoscopic TV apparatus comprising: 

. a parallax calculation section for calcu- 
lating binocular parallax from left-eye and right- 
eye images, and for calculating a maximum or 
minimum value of said binocular parallax; 
a viewing distance measuring section for 
measuring a viewing distance of a viewer; 
a resolution discrimination section for discrimi- 
nating the kind of an input image signal by 
detecting a synchronization frequency of said 
input image signal; 

an optimum parallax determining section for 
calculating the magnitude of binocular parallax 
of a displayed image from the output of said 
parallax calculation section, the output of said 
viewing distance measuring section, the output 
of said resolution discrimination section, and 
the size of a display screen, and for computing 
an amount of parallax change necessary to 
bring said binocular parallax within a binocular 
fusional range of said viewer; and 
a parallax control section for, in accordance 
with the output of said optimum parallax deter- 
mining section, translating the left-eye and 
right-eye images in horizontal directions so that 
stereoscopic images will be displayed within 
the binocular fusional range of said viewer 
even if the synchronization frequency of the 
input image frequency changes. 

15. In a system for simultaneously displaying a plurality 
of stereoscopic images in a windowing environ- 
ment, a stereoscopic TV apparatus comprising: 

a resolution discrimination section for discrimi- 
nating the kind of an input image signal by 
detecting a synchronization frequency of said 
input image signal; 

a window information management section for 
detecting the size of each individual window 
where a stereoscopic image is displayed; 
a viewing distance measuring section for 
measuring a viewing distance of a viewer; 
a parallax calculation section for calculating 
binocular parallax from left-eye and right-eye 
images, and for calculating a maximum or min- 
imum value of said binocular parallax; 
an optimum parallax determining section for 
calculating the actual size of each individual 
window from the output of said resolution dis- 
crimination section, the output of said window 
information management section, and the size 
of a display screen, calculating the magnitude 
of binocular parallax of an image calculated 



20 16. A stereoscopic TV apparatus according to claim 1 4 
or 15, wherein said viewing distance measuring 
section outputs as said viewing distance a recom- 
mended viewing distance for a stereoscopic TV. 

25 1 7. A stereoscopic TV apparatus according to claim 1 4 
or 15 . wherein said viewing distance measuring 
section measures viewing distances of a plurality of 
viewers, and outputs as said viewing distance an 
average value or weighted average value of said 
30 measured viewing distances or a maximum or min- 
imum value thereof. 

18. A stereoscopic TV apparatus according to claim 14 
or 15, wherein said parallax calculation section out- 

35 puts parallax as a predetermined fixed value. 

19. A stereoscopic TV apparatus according to claim 14 
or 15, wherein the output of said resolution discrim- 
ination section is set as a fixed value by assuming 

40 that only one kind of image is input. 

20. A stereoscopic TV apparatus according to claim 1 4 
or 15, wherein said resolution discrimination sec- 
tion detects the kind bf said input image signal 

45 among HDTV, EDTV and NTSC signals and com- 
puter-generated image signals of various resolu- 
tions, and identifies the resolution and aspect ratio 
of the detected kind of input image signal so that 
said optimum parallax determining section can rec- 

50 ognize an image size which is viewed by an opera- 

tor , on the display screen. 

21 . A stereoscopic TV apparatus according to claim 1 4 
or 15, wherein said parallax control section trans- 

55 lates the images in horizontal directions only when 
there occurs a significant change in the binocular 
parallax of the displayed images. 

22. A stereoscopic TV apparatus according to claim 14 



25 



30 
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or 15, wherein said parallax control section trans- 
lates the images in horizontal directions only when 
said viewer desires to adjust the binocular parallax 
and inputs an instruction using an instruction 
means such as a pushbutton switch or a remote 
controller. 

23. A stereoscopic TV apparatus according to claim 15, 
wherein said parallax control section translates the 
left-eye and right-eye images in horizontal direc- 
tions only for a window designated by said viewer 
as a window for which said viewer desires to adjust 
binocular parallax. 
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(54) Stereoscopic image generating apparatus and display therefor 

allax of displayed images is automatically brought within 
the viewer's binocular fusional range regardless of the 
size of a stereoscopic display used. 



(57) The stereoscopic CG image generating appa- 
ratus and a stereoscopic TV apparatus, has a projection 
transformation section which, based on three-dimen- 
sional structural information describing a three-dimen- 
sional shape of an object, generates a plurality of two- 
dimensional projection models as viewed from a plural- 
ity of viewpoints, a distance information extraction sec- 
tion which generates a camera-to-object distance 
information used for calculations in the projection trans- 
formation section, and a camera parameter determining 
section which, based on the output of the distance infor- 
mation extraction section, the screen size of a stereo- 
scopic image display device for displaying finally 
generated two-dimensional projection models, and a 
viewer's viewing distance, determines camera parame- 
ters so that stereoscopic CG images will be brought 
within the viewer's binocular fusional range. According 
to the thus constructed stereoscopic CG image generat- 
ing apparatus and stereoscopic TV apparatus, proper 
camera parameters (focal length or field of view, camera 
spacing, and converging point) are determined based 
on the camera-to-object distance information, the mag- 
nitude of parallax of the generated stereoscopic CG 
images on the display device (or in a window on the dis- 
play screen), and the viewing distance, so that easy-to- 
view stereoscopic CG images are automatically gener- 
ated regardless of the display size, and by horizontally 
translating left-eye and right-eye images, binocular par- 
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